Unsupervised Discovery of Facial Events: Learning a Dynamic Vocabulary for Facial Analysis
نویسندگان
چکیده
Automatic facial image analysis is a long standing research problem in computer vision. A key component in facial image analysis, largely conditioning the success of subsequent algorithms (e.g. facial expression recognition), is to define a vocabulary of possible dynamic facial events. To date, that vocabulary has come from the anatomically based Facial Action Coding System (FACS) or taxonomies derivative of FACS (e.g., EM-FACS and basic emotions). Each FACS action unit (AU) corresponds to the movements of one or more facial muscles. The aim of this paper is to discover a taxonomy of facial movements directly from video, without recourse to FACS or its derivative. Using unsupervised learning, we discover dynamic facial events directly from video of naturally occurring facial behavior (i.e., not posed) of multiple people. Several issues contribute to the challenge of this task. These include non-frontal pose, moderate to large out-of-plane head motion, large variability in the temporal scale of facial gestures, person variability and the exponential nature of possible facial action combinations. To address these problems, this paper proposes a novel temporal segmentation and multi-subject correspondence algorithm for matching expressions. Our method achieved good convergent validity with manual FACS annotation. To the best of our knowledge, this is the first attempt to learn a dynamic vocabulary of facial events from video.
منابع مشابه
Unsupervised Discovery of Facial Events: Learning a Dynamic Vocabulary for Facial Analysis
Automatic facial image analysis is a long standing research problem in computer vision. A key component in facial image analysis, largely conditioning the success of subsequent algorithms (e.g. facial expression recognition), is to define a vocabulary of possible dynamic facial events. To date, that vocabulary has come from the anatomically based Facial Action Coding System (FACS) or taxonomies...
متن کاملRI:Medium: Unsupervised and Weakly-Supervised Discovery of Facial Events
The face is one of the most powerful channels of nonverbal communication. Facial expression has been a focus of emotion research for over a hundred years [10]. It is central to several leading theories of emotion [17, 30, 47] and has been the focus of at times heated debate about issues in emotion science [18, 24, 43]. Facial expression figures prominently in research on almost every aspect of ...
متن کاملA Nonlinear Grayscale Morphological and Unsupervised method for Human Facial Synthesis Based on an Example Image
Human facial generation of example image is used as a requirement for biometric applications for the purpose of identifying individuals. In this paper, face generation consists of three main steps. In the first step, detection of significant lines and edges of the example image are carried out using nonlinear grayscale morphology. Then, hair areas are identified from the face of sample. The fin...
متن کاملRI:Medium: Unsupervised and Weakly-Supervised Discovery of Facial Events
The face is one of the most powerful channels of nonverbal communication. Facial expression has been a focus of emotion research for over a hundred years [11]. It is central to several leading theories of emotion [16, 28, 44] and has been the focus of at times heated debate about issues in emotion science [17, 23, 40]. Facial expression figures prominently in research on almost every aspect of ...
متن کاملHierarchical Unsupervised Learning of Facial Expression Categories
We consider the problem of unsupervised classification of temporal sequences of facial expressions in video. This problem arises in the design of an adaptive visual agent, which must be capable of identifying appropriate classes of visual events without supervision to effectively complete its tasks. We present a multilevel dynamic Bayesian network that learns the high-level dynamics of facial e...
متن کامل